A g- ANALOGUE OF DE FINETTI'S THEOREM 



ALEXANDER GNEDIN AND GRIGORI OLSHANSKI 

Abstract. A q-analogue of de Finetti's theorem is obtained in terms of a boundary 
problem for the g-Pascal graph. For q a power of prime this leads to a characterisation 
of random spaces over the Galois field ¥ q that are invariant under the natural action of 
the infinite group of invertible matrices with coefficients from ¥ q . 



1. Introduction 

The infinite symmetric group consists of bijections {1, 2, ... } — > {1,2,...} which 
move only finitely many integers. The group Oqo acts on the product space {0, 1}°° by 
permutations of the coordinates. A random element of this space, that is a random infinite 
binary sequence, is called exchangeable if its probability law is invariant under the action 
of &oo- De Finetti's theorem asserts that every exchangeable sequence can be generated 
in a unique way by the following two-step procedure: first choose at random the value 
of parameter p from some probability distribution on the unit interval [0, 1], then run an 
infinite Bernoulli process with probability p for l's. 

One approach to this classical result, as presented in Feller [31 Ch. VII, §4], is based on 
the following exciting connection with the Hausdorff moment problem. By exchangeabil- 
ity, the law of a random infinite binary sequence is determined by the array (v nj k) , where 
v nt k equals the probability of every initial sequence of length n with k l's. The rule of 
addition of probabilities yields the backward recursion 

v n ,k = v n+ljk + v n+ljk+1 , < k < n, n = 0, 1, . . . , (1) 

which readily implies that the array can be derived by iterated differencing of the sequence 
(^n,o)n=o,i,..- Specifically, setting 

uf ) = vi +k ,k, 1 = 0,1,..., k = 0,l,..., (2) 
and denoting by 5 the difference operator acting on sequences u = (wz)z=o,i,... as 

(8u)i = ui- u i+ i, 

the recursion ([T|) can be written as 

= k=l,2,.... (3) 
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Since v n ^ > 0, the sequence must be completely monotone, that is, componentwise 

^■ ■ ■o^ (0) > 0, A; = 0,1,..., 

k 

but then Hausdorff 's theorem implies that there exists a representation 

,( fc ) _ f Jfc 



/ P fc (i-prv(dp) (4) 

J [0,1] 

with uniquely determined probability measure [l. De Finetti's theorem follows since v n ^ = 
p (1 — p) n ~ k for the Bernoulli process with parameter p. See [1] for other proofs and 
extensive survey of generalisations of this result. 

The present note is devoted to variations on the g-analogue of de Finetti's theorem, 
which was briefly outlined in Kerov [§] within the framework of the boundary problem for 
generalised Stirling triangles. The boundary problem for other weighted versions of the 
Pascal triangle was studied in [I] , [6] , and for more general graded graphs in [5] , [9] , [10] . 

Definition 1.1. Given q > 0, let us say that a random binary sequence e = (e±, £2, . . . ) G 
{0, 1} 00 is q- exchangeable if its probability law P is ©oo-quasiinvariant with a specific co- 
cycle, which is uniquely determined by the following condition: Denoting by P(ei, . . . , e n ) 
the probability of an initial sequence {s\, . . . , e n ), we have for any i = 1, . . . , n — 1 

P(ei, . . • , £i-i, £i, £j+2) • • • j £ n) — (f l El+1 P(£i, • • • , e n )- 

In words: under an elementary transposition of the form (...,1,0,...) — > (. . . , 0, 1, . . .), 
probability is multiplied by q. 

Theorem 1.2. Assume < q < 1. There is a bijective correspondence P <-> // between the 
probability laws P of infinite q- exchangeable binary sequences and the probability measures 
fi on the closed countable set 

A q :={l,g,g 2 ,...}U{0}c [0,1]. 

More precisely, a g-exchangeable sequence can be generated in a unique way by first 
choosing at random a point iGA, distributed according to \x and then running a certain 
g-analogue of the Bernoulli process indexed by x. Each law P is uniquely determined by 
the infinite triangular array 

v nM :=P(1,...,1,0,...,0), 0<A;<n<oo, (5) 

k n—k 

which in turn is given by a q- version of formula (jl]), with A being replaced by A q (Theorem 
13. 2ft . A similar result with switching the roles of 0's and l's and replacing q by q~ l also 
holds for q > 1. 

The rest of the paper is organized as follows. In Section [2] we introduce the g-Pascal 
graph and formulate the (/-exchangeability in terms of certain Markov chains on this 
graph. In Section [3] we find a characteristic recursion for the numbers (JSJ), which is a 
g-deformation of (JTJ), and we prove the main result, equivalent to Theorem 11.21 using the 
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method of [10] . In Section H] we discuss three examples: two g-analogues of the Bernoulli 
process and a g-analogue of Polya's urn process. Finally, in Section for q a power of a 
prime number, we provide an interpretation of the theorem in terms of random subspaces 
in an infinite-dimensional vector space over ¥ q . 

2. The ^-Pascal graph 

For q > 0, the g-Pascal graph is a weighted directed graph T(q) on the infinite vertex 
set 



Each vertex (I, k) has two weighted outgoing edges (/, k) — > (l + l, k) and (/, k) — > (I, k+ 1) 
with weights 1 and q , respectively. The vertex set is divided into levels T n = {(l,k) : 
I + k = n}, so T = U n >oP n with r consisting of the sole root vertex (0, 0). For a path in 
T connecting two vertices (I, k) G Ti +k and (A, x) G T\ + „ we define the weight to be the 
product of weights of edges along the path. For instance, the weight of (2, 3) — > (2, 4) — > 
(3, 4) — > (3, 5) is q 5 = q 2 ■ 1 ■ q 3 . Clearly, such a path exists if and only if A > I, x > k. 

We shall consider certain transient Markov chains S = (S n ), with state-space T, which 
start at the root (0, 0) and move along the directed edges, so that S n G T n for every 
n = 0, 1, . . . . Thus, a trajectory of S is an infinite directed path in T started at the root. 

Definition 2.1. Adopting the terminology introduced by Vershik and Kerov (see [H]), we 
say that a Markov chain S on T(q) is central if the following condition is satisfied for each 
vertex (n — k,k) G T n visited by S with positive probability: given S n = (n — k, k), the 
conditional probability that S follows each particular path connecting (0, 0) and (n — k, k) 
is proportional to the weight of the path. 

Remark 2.2. If we only require the centrality condition to hold for all (l,k) G T u for 
fixed u, then we have it satisfied also for all (I, k) with I + k < v. From this it is easy 
to see that the centrality condition implies the Markov property of S in reversed time 
n = ... ,1,0, hence also implies the Markov property in forward time n = 0, 1, . . .. 

In the special case q — 1 Definition 12. II means that in the Pascal graph T(l) all paths 
with common endpoints are equally likely. 

Recall a bijection between the infinite binary sequences (ei, e 2 > • • • ) an d infinite directed 
paths in T started at the root (0,0). Specifically, given a path, the nth digit e n is given 
the value or 1 depending on whether I or k coordinate is increased by 1. Indentifying 
a path with a sequence (n — K n ,K n ) (where < K n < n), the correspondence can be 
written as 



Proposition 2.3. By virtue of the bijection between {0, 1}°° and the paths in V , each q- 
exchangeable sequence corresponds to a central Markov chain on T(q), and vice versa. 



T = {(l,k):l,k = 0,l,...}. 



n 
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Proof. This follows readily from Remark 12.21 Definitions 11.11 and 12.11 and the structure of 

r( g ). □ 



We shall use the standard notation 

[n] :=l + q + ... + q n -\ [n]! := [1] • [2] • • • [n] , 



[n}\ 



[k]\[n-k]\ 

for g-integers, g-factorials and g-binomial coefficients, respectively, with the usual conven- 



tion that 



for n < or k < 0. Furthermore, we set 



fc-i 



x 



Q)k ■= ~ x< ?) ' 1 - k - 00 ' 



i=0 



with the infinite product (k = oo) considered for < q < 1. 

The following lemma justifies the name of the graph by relating it to the g-Pascal 
triangle of g-binomial coefficients. 

Lemma 2.4. The sum of weights of all directed paths from the root (0, 0) to a vertex 
(n — k,k), denoted d n ^, is given by 



dn,k 



(6) 



More generally, d v ^, the sum of weights of all paths connecting two vertices (n — k, k) and 
[y — x, x) in T is given by 



jv,™ — Jx-k)(n-k) 
a n,k — 1 



v — n 
x — k 



Proof. Note that any path from (0, 0) to (n — k, k) has the second component incrementing 
by 1 on some k edges (Zj, % — 1) — > (Zj, i), where i — 1, 2, . . . , k and < l\ < • ■ • < < n — k, 
thus the sum of weights is equal to 



d 



E 



n. k :z: 7 . Q 

0<h<-<l k <n-k 



h+-+h 



(7) 



This array satisfies the recursion 

d n ,k = q n ~ k d n _ X u X + d n _ ltk , < k < n 



with the boundary conditions d nj o = d Utn = 1. On the other hand, it is well known that 
the array of g-binomial coefficients also satisfies this recursion [8] , hence by the uniqueness 
d n ^ is the g-binomial coefficient. In the like way the sum of weights of paths from {n — k, k) 
to {v — x, x) is 



d: 



E 



,,.k - 1 

n—k<l\<---<l k i<v—>c 



h+-+h 



k' 



X 



- k. 
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Comparing with ([7]) we see that this is equal to q 



(n-fc)fc' 



v — x 
k! 



□ 



Remark 2.5. Changing (/, fc) to (k, I) yields the dual g-Pascal graph V*(q), which has the 
same set of vertices and edges as T(q), but different weights: the edge (l,k) — ► (l,k + 1) 
has now weight 1, and the edge (l,k) — ► (I + has weight The sum of weights 
of paths in T* from (0,0) to (l,k) is again (jSj), which is related to another recursion for 
g-binomial coefficients, d n , k = d n -i jk -x + q k d n -\ tk . 

Consider the recursion 

v n ,k = v n+1>k + q n ~ k v n+1>k+1 , with v 0fi = 1, (9) 

which is dual to (jSJ, and denote by V the set of nonnegative solutions to Qj. 

Proposition 2.6. Formula 

¥{S n = (n - k,k)} = d njk v n , k , (n-k,k) eT 

establishes a bijective correspondence P <-> i> between the probability laws of central Markov 
chains S = (S n ) on T(q) and solutions v G V to recursion (J2]). 

Proof. Let 5 be a central Markov chain on T with probability law P. Observe that the 
property in Definition 12.11 means precisely that the one-step backward transition proba- 
bilities (that is, transition probabilities in the inverse time) are of the standard form 

¥{Sn-i = (n - 1, k) | S n = (n, k)} = ^zM = (10) 

nSn-i = (n-l,k-l)\S n = (n,k)} = d -^-^ k = g»-*M (11) 

for every such S. 

Introduce the notation 

v n>k :=F{S n = (n-k,k)}, (n-k,k)eT. (12) 

Consistency of the distributions of 5 n 's amounts to the rule of total probability 

v n . k = F{S n = (n, k) | S n+l = (n + 1, k)}v n+hk 

+ F{S n = (n, k) | S n+1 = (n + l,k+ l)}v n+lik+1 . (13) 
Rewriting ([TBI , using ffTUl) and f[TTj) . and setting 

^n,fc = d^ k v n>k (14) 
we get dUJ), which means that t> G V. Thus, we have constructed the correspondence 

Conversely, start with a solution t> G V and pass to v — (v n> k) according to (fT4l) . For 
each n consider the measure on r n with weights v Hi0 , . . . ,v Ujn . Since the weight of the 
root is 1, it follows from (Q by induction in n that these are probability measures. Again 
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by (jHJ), the marginal measures are consistent with the backward transition probabilities, 
hence determine the probability law of a central Markov chain on T(q). Thus, we get the 
inverse correspondence v t— > P. □ 

By virtue of Propositions 12.31 and 12.61 the law of g-exchangeable infinite binary sequence 
is determined by some v G V, with the entries v Uyk having the same meaning as in (j3J). In 
the sequel this law will be sometimes denoted P„. 



3. The boundary problem 

The set V is a Choquet simplex, meaning a convex set which is compact in the prod- 
uct topology of the space of functions on Y and has the property of uniqueness of the 
barycentric decomposition of each v G V over the set of extreme elements of V (see, e. g., 
[3 Proposition 10.21]). 

The boundary problem for the g-Pascal graph amounts to describing extreme nonnega- 
tive solutions to the recursion (jHJ). Each extreme solution v G V corresponds to ergodic 
process (S n ) for which the tail sigma-algebra is trivial. In this context, the set of extremes 
is also known as the minimal boundary. 

With each array v G V, v — (v n> k), it is convenient to associate another array v = (v njk ) 
related to v via fHM . Clearly, the mapping v <-> v is an isomorphism of two Choquet 
simplexes V and V = {v}. Recall that the meaning of the quantities v nk is explained in 

m. 

A common approach to the boundary problem calls for identifying a possibly larger 
Martin boundary (see [10] . [6], [I] for applications of the method). To this end, we need 
to consider multistep backward transition probabilities, which by Lemma M are given by 
a g-analogue of the hypergeometric distribution 

£ n ,Jfc(f, X ) := ^{Sn = in - k,k)\S„ = (U - X, x)} 

, k = 0,...,n, (15) 

and to examine the limiting regimes for x = x[y) as v —>■ oo, under which the probabilities 
( TT5|) converge for all fixed [n — k,k) G T. If the limits exist, the limiting array 

v n)k := lim v n k (iy, x) 

belongs necessarily to V. 

Suppose < q < 1 and introduce polynomials 

$ n ,k(x) ■= q- k(n - h) x n - k (x, q~ l ) k , $ n , fc = d n ^ n , k , < k < n. (16) 

Obviously, the degree of $ nj fc is n; we will consider the polynomial as a function on A q . 
Observe also that $ nj fc(x) vanishes at points x = q* with x < k, because of vanishing of 
{x,q~ l ) k . 



(n—k)(n—k) 



v — n 




n 




x — k 




k 


/ 
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Lemma 3.1. Suppose < q < 1, and let in ( |T5|) the indices n and k remain fixed, while 
v — > oo and x = >c{y) varies in some way with v. Then the limit of ( 1T51) is ^^(g^) if x 
constant for large enough v. If x — > oo i/ien the limit is $ n ,fc(0) = <5 n ,fc- 

Proof. Assume first x — > oo and show that the limit of (|T5|) is Since the quantities 
v n ^{y, x), where = 0, . . . , n, form a probability distribution, it suffices to check that the 
limit exists and is equal to 1 for k = n. In this case the right-hand side of (fi~5l) becomes 



n 



x — n + i\ 



, i/ - n + z 



Because hn^.-^ [m] = 1/(1 — q) for g < 1, this indeed converges to 1 provided that 
x — > oo. 

Now suppose x is fixed for all large enough v. The right-hand side of (IT5|) is for 
k> x. For k < k using linXm^oofm — j]!/[m]! = (1 — g) J we obtain 



:i - 9) ' M! ^5„,„( 9 "). (17) 



v — n 


1 


V 


[z/ — n]! 


[z/- x]! 


[x]! 


k — k 




X 


H! 


[v — x — (n — fc)]! 


[x- fc]! 



[x - fc]! 

□ 

Part (i) of the next theorem appeared in [91 Chapter 1, Section 4, Corollary 6]. Kerov 
pointed out that the proof could be concluded from the Kerov- Vershik 'ring theorem' (see 
[SI Section 8.7]), but did not give details. 

For n a measure, we shall write (jl{x) instead of fi({x}), meaning atomic mass at x. 

Theorem 3.2. Assume < q < 1. 
(i) The formulas 



establish a linear homeomorphism between the set V (respectively, V) and the set of all 
probability measures \i on A q . 

(ii) Given v G V, the corresponding measure fi is determined by 

H{q") = lim v„ tX , x = 0, 1, . . . ; /x(0) = 1 - V] fJ.(q x ). 

V—tOO ' ' 

x6{Q,l,...} 

Proof. As in [10], the assertions (i) and (ii) are consequences of the following claims (a), 
(b) and (c). 

(a) For each v = 0, 1, 2, . . . , the vertex set T u is embedded into A q via the map (u, x) i— > 
q*. Observe that, as v — > oo, the image of Y v in A q expands and in the limit exhausts the 
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whole set A q , except point 0, which is a limit point. In this sense, A q is approximated by 
the sets T u as v — > oo. 

(b) The multistep backward transition probabilities ()15p converge to ^^(g^), for < 
x < oo, in the regimes described by Lemma [3.11 

(c) The linear span of the functions $> nt k{x), (n — k,k) £ T, is the space of all polyno- 
mials, so that it is dense in the Banach space C(A q ). □ 

Note that part (ii) of the theorem can be rephrased as follows: given v £ V, consider 
the probability distribution on T n determined by v n>m and take its pushforward under the 
embedding T v ^ A q . The resulting probability measure on A q weakly converges to \i as 
n — > oo. 

Corollary 3.3. For < q < 1 we have: 

(i) The extreme elements of V are parameterised by the points x £ A q and have the 
form 

v n ,k = $n,k(x), < k < n. (18) 

(ii) The Martin boundary of the graph T(q) coincides with its minimal boundary and 
can be identified with A q C [0, 1] via the function v \— > t>i j0 . 

Proof. All the claims are immediate. We only comment on the fact the parameter x £ A q 
is recovered as the value of vx t Q\ this holds because $i ) o(x) — x. □ 

Letting q —> 1 we have a phase transition: the discrete boundary A q becomes more and 
more dense and eventually fills the whole of [0, 1] at q — 1. 

As is seen from (fl6j) . the polynomial can be viewed as a g-analogue of the 

polynomial x n ~ k (l — x) k , so that fflBl is a g-analogue of (j4j). Keep in mind that x = q* 
is a counterpart of 1 — p, the probability of E\ = 0. The following g-analogue of the 
Hausdorff problem of moments emerges. Introduce a modified difference operator acting 
on sequences u = (Mi)«=o,i,--- as 

(8 q u)i = q~\ui / = 0,1,.... 

Corollary 3.4. Assume < q < 1. A real sequence u = (m;)«=o,i,... with it = 1 is a 
moment sequence of a probability measure // supported by A q C [0, 1] if and only if u is 
'q-completely monotone' in the sense that for every k = 0, 1, . . . we have componentwise 

5 q o • ■ ■ o 5 q u > 0, k — 0, 1, 

k 

Proof. Using the notation vi + k^ = as in ([2]), we see that the recursion is equivalent 
to = <5 g M^ _1 \ cf. ()3]). Then we use the fact that $ nj o(a;) = x n and repeat in the reverse 
order the argument of Section [TJ □ 
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The case q > 1. This case can be readily reduced to the case with parameter < q < 1, 
where q := q~ l . It is convenient to adopt a more detailed notation [n] q for the g-integers. 

Lemma 3.5. For every q > 0, q = q~ x , the backward transition probabilities (flO]) . ( II II) 
for the graph T(q) and the dual graph T*(q) are the same. 

Proof. Indeed, by virtue of (TTTJT) . (fill) , this is reduced to the equality 



The lemma implies that the boundary problem for q > 1 can be treated by passing to 
q^ 1 < 1 and changing (l,k) to (k,l). In terms of the binary encoding of the path, this 
means switching 0's with l's. 

Kerov 0, Chapter 1, Section 2.2] gives more examples of 'similar' graphs, which have 
different edge weights but the same backward transition probabilities. 



A g-analogue of the Bernoulli process. Our first example is a description of the extreme 
g-exchangeable infinite binary sequences. 

With each infinite binary sequence we associate some T -sequence (T , T, T 2 , . . . ) of 
nonnegative integers, where Tj is the length of jth run of 0's. That is to say, To is the 
number of 0's before the first 1, T\ is the number of 0's between the first and second l's, 
T 2 is the number of 0's between the second and third l's, and so on. Clearly, this is a 
bijection, i.e. a binary sequence can be recovered from its T-sequence as 



If q = 1, then the Bernoulli process with parameter p has a simple description in terms 
of the associated random T-sequence: all T are independent and have the same geometric 
distribution with parameter 1 — p. 

Proposition 4.1. Assume < q < 1. For x £ A q , let v(x) = (v n ^{x)) be the extreme 
element of V corresponding to x. Consider q- exchangeable infinite binary sequence e = 
(ei,62, . . . ) under the probability law f v ( x ) and let (To,Ti, . . . ) be the associated random 
T-sequence. 

(i) If x = q* with x = 1, 2, . . . then T , . . . , T^-i are independent, = oo, and T has 
geometric distribution with parameter q*~ % for < i < x — 1 . 

(ii) If x = 1 then T = oo ; which means that with probability one e is the sequence 
(0, 0, . . .) of only 's. 

(iii) If x = then T = T = • • ■ = 0, which means that with probability one e is the 
sequence (1, 1, . . .) of only 1 ? s. 



[n ~ fc L = -fc [n ~ k h 
[n] q [n)q 



□ 



4. Examples 



(0,... ,0,1,0,. ..,0,1,0,. ..,0,1,...). 
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Proof. Consider the central Markov chain S = (S n ) corresponding to the extreme element 
v{q H ). Computing the forward transition probabilities, from (|18|) and ffTUl) . for < k < x 
we have 

nSn+i = (n+l-k,k)\S n =(n-k,k)} 

_ (q n+1 - k - 1) d n+hk <5> n+1>k (q") _ k 
(q n -l) d n , k <S> n , k (qx) 

This implies (i) and (ii). In the limit case x = corresponding to x — > +oo, the above 
probability equals 0, which entails (iii). □ 

The analogy with the Bernoulli process is evident from the above description of the 
binary sequence £{q*)- Moreover, the Bernoulli process appears as a limit. Indeed, fix 
p G (0, 1) and suppose x varies with q, as q f 1, in such a way that 

- log(l - p) 

x ~ . 

l-q 

In this limiting regime, q^ k — > 1 —p for every k, hence (T , Ti, . . . ) weakly converges to an 
infinite sequence of i.i.d. geometric variables with parameter 1—p, and the random binary 
sequence e{q >c ) converges in distribution to the Bernoulli process with the frequency of 
0's equal to 1 — p. 

Another g-analogue of Bernoulli process. Following [9J, another g-analogue of Bernoulli 
process is suggested by the g-binomial formula (see [8]) 



n 

' q k{k-i)/2 n g K 

k=0 



-0.q)„ » 



For 9 e [0, oo] we define a probability law F w e for S = (S n ) by setting 

Qk q k(k-l)/2 



w 



n ' k ' (-e,q) n 



¥ w o{S n = (n — k, k)} := d n , k wl k , (n, k) e V. (20) 



Checking (Q is immediate. Computing forward transition probabilities, 

W w e{S n +i = (n + 1 - k, k) I S n = (n - k, k)} = 1/(1 + 6q n ), 

shows that under F w e the process S n = (n — K n , K n ) has independent inhomogeneous 
increments, with probability Qq n ~ x /(l + Oq" 1 ^ 1 ) for increment K n — K n _\ = 1. For q = 1 
we are back to the ergodic Bernoulli process, but for < q < 1 the process is not extreme. 
To obtain the barycentric decomposition of w e over extremes, 

0<x<oo 
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we can apply Theorem 13. 2( ii) to compute from ( 120|) 

1 qx(x-t)/2Q3 



= lim F w e{S n = (n - x, x)} 



[-e^qU (l-g)*[x]f 
This measure \x may be viewed as a g-analogue of the Poisson distribution. 

A g-analogue of Polya's urn process. The conventional Polya's urn process is described in 
[31 Section 7.4]. Here we provide its natural deformation. 

Fix a, b > and < q < 1. Consider the Markov chain (S n ) on F with the forward 
transition probabilities from (n — k,k) to (n + 1 — fc, k) and from (n — k, k) to (n — k,k + 1) 
given by 

+ and r [a + k] 

[a + o + n\ [a + o + n\ 

respectively. Then the distribution at time n is 



¥{S n = (n-k,k)} 



q bk 



[fl][ fl +l]...[a + A ; -l][6][6+l]---[6 + n-fc-l] 
[a + b) [a + b + 1] • • • [a + b + n - 1] 



Checking consistency is easy. The conventional Polya's urn process appears in the 
limit q — > 1. The corresponding probability measure fi is computable from Theorem 
ICTii) as 

lim F{S n = (n — x, x)} 

n— >oo 

For a = 1, the limit distribution of the coordinate x is geometric with parameter 1 — q b . 
For general a, b we obtain a measure on A q 

( yc\ (q a ,q)*(q b ,q)o ^ x _ A 
g)x(g a+b ; ?)oo 

which may be viewed as a g-analogue of the beta distribution on [0, 1]. 



5. Grassmannians over a finite field 

For q a power of a prime number, let W q be the Galois field with q elements. Define V n 
to be the n-dimensional space of sequences £ 2 , • • •) with entries from F 9 , which satisfy 
£i = for i > n. The spaces {0} = Vo C V± C V% C . . . comprise a complete flag, and the 
union := U n >oV^ is a countable, infinite-dimensional space over ¥ q . 

By the Grassmannian Gr(\4o) we mean the set of all vector subspaces X C V^. Like- 
wise, for n > let Gr(V^) be the set of all vector subspaces in V n , with Gr(Vo) being a 
singleton. Consider the projection ir n+ i >n : Gr(V^ + i) — >■ Gr(V n ) which sends a subspace of 
V n+ i to its intersection with V n . 
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Lemma 5.1. There is a canonical bijection X «-> (X n ) between the Grassmannian 
Gr(V^o) and the set of sequences (X n G Gr(14), n > 0) satisfying the consistency condi- 
tion X n = 7r n+ i in (X„ + i) for each n. 

Proof. Indeed, the mapping X i— > (X n ) is given by setting X n = X nV n for each n, while 
the mapping (X n ) h- > X is defined by X = UX n . □ 

The lemma shows that Gr(V^o) can be identified with a projective limit of the finite 
sets Gr(V^), the projections being the maps 7r n+l n . Using this identification we endow 
Gr(Voo) with the corresponding topology, in which Gr(T^o) becomes a totally disconnected 
compact space. For X G Gr(V r oc ), a fundamental system of its neighborhoods is comprised 
of the sets of the form {X' G GrfVoo) : X' n = X n }, where n = 1, 2, . . . . 

Let 5f n = GL(n,¥ q ) be the group of invertible linear transformations of the space 
V n , realised as the group of transformations of Voo which may only change the first n 
coordinates. We have then {e} = ^ C C 5^ C . . . and we define := U£f n . The 
countable group consists of infinite invertible matrices (gij), such that g^- = 5ij for 
large enough i + j. The group ^ acts on hence also acts on Gr(V A oc ). 

A probability distribution on Gr(V r oc ) defines a random subspace of V^. We look at 
random subspaces of whose distribution is invariant under the action of ^x,. Observe 
that the action of £f n splits Gr(Ki) into orbits 

G(n, k) = {X e Gr(K), dimX = k}, < fc < n, 

where #G(n, k) = d n ^ is the number of k- dimensional subspaces of V n . Therefore, a prob- 
ability distribution on Gr(V^o) is ^oo-invariant if and only if the conditional distribution 
on each G(n, k) is uniform. 

It must be clear that this setting of '(/-exchangeability' of linear spaces is analogous 
to the framework of de Finetti's theorem: exchangeability of a random binary sequence 
means that the conditional measure is uniform on sequences of length n with k l's. See 
PQ, pj for more on symmetries and sufficiency 

Lemma 5.2. Formula 

~v nM = P{X G Gr(y oo ) : X n V n G G(n, k)}, (n, k)eT 

establishes a linear homeomorphism between V and ^-invariant probability measures on 
the Grassmannian Gr(V r oc ). 

Proof. We first spell out more carefully the remark before the lemma. Consider projections 

vToc.n : Gr(V oc ) -> Gr(K), X ^ X f)V n , X G Gr^), n = 1, 2, . . . . 

If P is a Borel probability measure on the space Gr(V^o), then, for any n, the pushforward 
P n '■= ftoo,n(P) is a probability measure on Gr(V n ), and the measures P n are consistent 
with respect to the projections 7r n+ i jn , that is, 

Pn — 7r n+l,n(-Pn+l), 71 = 0, 1, 2, . . . . 



A g- ANALOGUE OF DE FINETTI'S THEOREM 



13 



Conversely, if a sequence (P n ) of probability measures is consistent, then it determines 
a probability measure P on Gr(V r oc ). Moreover, P is ^^-invariant if and only if each P n 
is ^-invariant. Next, observe that if P n is a ^-invariant probability measure, then it 
assigns the same weight to each fc-dimensional space X n G G(n,k); let us denote this 
weight by v n>fc . 

Fix X n G G(n, k). We claim that there are precisely q n ~ k +l subspaces X n+1 G Gr(T4,+i) 
such that X n+ i f)V n = X n : one subspace from G(n + l,k) and q n ~ k subspaces from 
G(n + 1, k+1). Indeed, dimX n+1 equals either k or k+1. In the former case X n+1 = X n , 
while in the latter case X n+1 is spanned by X n and a nonzero vector from V n+ i \ V n . Such 
a vector is defined uniquely up to a scalar multiple and addition of an arbitrary vector 
from X n . Therefore, the number of options is equal to the number of lines in V n+ \/X n 
not contained in V n /X n , which equals 

„n+l— k i „rt— fe i 

y * _ y ^ __ ^n-ft 

g — 1 q — 1 

Now, let P be a ^oo-invariant probability measure on Gr(V r oc ), with projections (P n ) 
specified by the corresponding array of weights v = (f n ,fc)- Then the relations P n = 
7Tn+i,n(-fn+i) together with the dimension computation imply that v satisfies (JSJ). 

Conversely, given v G V, we can construct a sequence (P n ) of measures such that P n 
lives on Gr(V^), is invariant under ^ n and agrees with P n+ \ under 7r n+l n . Since P , which 
lives on a singleton, is obviously a probability measure, we obtain by induction that all P n 
are probability measures. Taking their projective limit we get a ^o-invariant probability 
measure P on Gr(V r oc ). □ 

Rephrasing Theorem 13.21 we have from the lemma 

Corollary 5.3. The ergodic ^oo-invariant probability measures on Gr(Voo) are parame- 
terised by >c e {0, 1, ... , oo}. For x = the measure is the Dirac mass at V^, for x = oo 
it is the Dirac mass at V , and for < x < oo the measure is supported by the set of 
subspaces of of codimension x. 

The following random algorithm describes explicitly the dynamics of the growing space 
X n G Gr(V n ) as n varies, under the ergodic measure with parameter x. Recall the notation 
q = q~ l . Start with X = V . With probability q* choose Xi = Vi, and with probability 
1 — q* choose X\ = X . Suppose X n C V n has been constructed and has dimension n — k 
with k < k. Then let X n+ i = X n with probability 1 — q >c ~ k , and with probability q H 
choose uniformly at random a nonzero vector £ G Ki+i \ Vn and let X n+ \ be the linear 
span of X n and £. 

Duality. We finish with a dual version of our construction. Let V°° denote the set of all 
sequences r\ = (rji, r]2, ■ ■ ■ ) with entries from ¥ q . This is again a vector space over ¥ q , 
strictly larger than Voo since we do not require rj to have finitely many nonzero entries. 
That is to say, V°° is just the infinite product space (F^) 00 , which we endow with the 
product topology. Let Gr(V°°) denote the set of all closed subspaces Y C V°° . A dual 
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version of Lemma 15.11 says that such subspaces Y are in a bijective correspondence with 
the sequences (Y n £ Gr(V n ),n > 0) such that Y n = 7r' n+1 n (Y n+1 ), where n' n+1 n is induced 
by the projection map V n+ \ — > V n which sets the (n+ l)th coordinate of a vector £ £ V n+ \ 
equal to 0. The branching of G(n, fc)'s under these projections corresponds to the dual 
g-Pascal graph. 

Lemma 5.4. The operation of passing to the orthogonal complement with respect to the 
bilinear form 

oo 

i=i 

is a bijection Gr(l / 00 ) <-> Gr(V°°). 

Proof. First of all, note that the bilinear form is well defined, because the coordinates £j of 
£ £ K» vanish for z large enough. This form determines a bilinear pairing x V°° — > F q . 
We claim that it brings the spaces V^ and into duality, where V°° is viewed as a 
vector space with nontrivial topology, and the topology on is discrete. 

Indeed, it is evident that the pairing is nondegenerate and that any linear functional 
on Voo is given by a vector of V 00 . A minor reflection also shows that, conversely, any 
continuous linear functional on V°° is given by a vector from V^. Thus, the spaces Voo and 
V°° are indeed dual to one another. They are also dual as commutative locally compact 
topological groups: one is discrete and the other is compact. 

Using the duality, it is readily checked that if X is an arbitrary subspace in Voo-, then 
its orthogonal complement X 1 - is a closed subspace in V°°, whose orthogonal complement 
(X- 1 ) 1 - coincides with X. Likewise, starting with a closed subspace Y C V°°, we have 
Y x Q Voo and {Y- 1 ) 1 - = Y. Thus, the operation of taking the orthogonal complement is 
a bijection. □ 

The group acts on both and V°° and preserves the pairing between these vector 
spaces. Under the identification Gr(V°°) = Gr(V r oc ), the group ^ acts by homeomor- 
phisms on this compact space. In the dual picture, the ergodic measures with x < oo live 
on the set of x-dimensional subspaces of V°° . The case x = oo corresponds then to the 
zero subspace in Voo (or the full space V°°). There is a simple explanation why we have to 
fix codimension in the Voo-picture and dimension in the V^-picture, and not vice versa. 
Namely, the subspaces in Voo of fixed nonzero finite dimension form a countable set, which 
is a single ^oo-orbit, and such a ^-space cannot carry a finite invariant measure. 
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